59 research outputs found

    Simulation of Multiregional Population Change: An Application to the German Democratic Republic

    Get PDF
    This paper uses the multiregional demographic model to simulate to the year 2030 six scenarios of population development in the German Democratic Republic. It extends the work of the Migration and Settlement case study for the GDR by Mohs (1980) by illustrating how the model can be used as a tool for making simulations that are based on changing rates of fertility, mortality, and migration

    TechMiner: Extracting Technologies from Academic Publications

    Get PDF
    In recent years we have seen the emergence of a variety of scholarly datasets. Typically these capture ‘standard’ scholarly entities and their connections, such as authors, affiliations, venues, publications, citations, and others. However, as the repositories grow and the technology improves, researchers are adding new entities to these repositories to develop a richer model of the scholarly domain. In this paper, we introduce TechMiner, a new approach, which combines NLP, machine learning and semantic technologies, for mining technologies from research publications and generating an OWL ontology describing their relationships with other research entities. The resulting knowledge base can support a number of tasks, such as: richer semantic search, which can exploit the technology dimension to support better retrieval of publications; richer expert search; monitoring the emergence and impact of new technologies, both within and across scientific fields; studying the scholarly dynamics associated with the emergence of new technologies; and others. TechMiner was evaluated on a manually annotated gold standard and the results indicate that it significantly outperforms alternative NLP approaches and that its semantic features improve performance significantly with respect to both recall and precision

    Determination of puberty in gilts: contrast of diagnostic methods

    Get PDF
    Background Early onset of a gilt´s puberty is needed for adequate economic performance in farms, because it indicates her reproductive performance and longevity. Therefore, an effective diagnosis is needed. Our purpose was to compare different procedures (external characteristics, blood progesterone analysis and ultrasonography diagnosis) to detect puberty in 70 gilts (Topigs TN70; 240 days old) on farm conditions. Postmortem examination was the standard reference. Multiple logistic regression analysis was used to identify which combination of independent variables (predictors) best predicts the status of gilts. Results Puberty (46/70 gilts; 65.71%) was characterized by the presence of follicles larger than 6 mm, corpus albicans, corpus rubrum, and corpus luteum (postmortem examination). Vaginal length, body condition, backfat, carcass weight and progesterone blood concentration were significantly higher in pubertal than prepubertal gilts (P < 0.05). Two types of ultrasonography equipment (DELTA and W3) were compared and performed by the same senior technician (V1). The results obtained by two technicians with different levels of experience (V1 and V2, a junior technician) using W3 were also compared. Ultrasonography provided better results than other diagnostic techniques, although the effectiveness of the ultrasonography changed with technological improvements and with increased expertise of technicians. The most accurate results were found by V1/DELTA (Nagelkerke´s R2 = 0.846; Sensitivity = 0.956; Specificity = 0.958; Positive predictive value = 0.978; Negative predictive value = 0.920; Area under ROC curve = 0.957). Results using the W3 equipment could be improved when used in conjunction with vaginal length (V1; Nagelkerke´s R2 = 0.834; Sensitivity = 0.933; Specificity = 0.958; Positive predictive value = 0.977; Negative predictive value = 0.885; Area under ROC curve = 0.972) or progesterone concentration (V2; Nagelkerke´s R2 = 0.780; Sensitivity = 0.955; Specificity = 0.826; Positive predictive value = 0.915; Negative predictive value = 0.905; Area under ROC curve = 0.970). Conclusions Ultrasonography provided better results than other diagnostic techniques. The effectiveness of the ultrasonography changes with technological improvements and with increased expertise of technicians. Results using the W3 equipment could be improved when used along with vaginal length (V1) or progesterone concentration (V2). Accuracy parameters are a guide to choose puberty diagnosis, but the farms must also evaluate effect on gilts, ease and cost of administration

    Why reinvent the wheel: Let's build question answering systems together

    Get PDF
    Modern question answering (QA) systems need to flexibly integrate a number of components specialised to fulfil specific tasks in a QA pipeline. Key QA tasks include Named Entity Recognition and Disambiguation, Relation Extraction, and Query Building. Since a number of different software components exist that implement different strategies for each of these tasks, it is a major challenge to select and combine the most suitable components into a QA system, given the characteristics of a question. We study this optimisation problem and train classifiers, which take features of a question as input and have the goal of optimising the selection of QA components based on those features. We then devise a greedy algorithm to identify the pipelines that include the suitable components and can effectively answer the given question. We implement this model within Frankenstein, a QA framework able to select QA components and compose QA pipelines. We evaluate the effectiveness of the pipelines generated by Frankenstein using the QALD and LC-QuAD benchmarks. These results not only suggest that Frankenstein precisely solves the QA optimisation problem but also enables the automatic composition of optimised QA pipelines, which outperform the static Baseline QA pipeline. Thanks to this flexible and fully automated pipeline generation process, new QA components can be easily included in Frankenstein, thus improving the performance of the generated pipelines

    Improving Editorial Workflow and Metadata Quality at Springer Nature

    Get PDF
    Identifying the research topics that best describe the scope of a scientific publication is a crucial task for editors, in particular because the quality of these annotations determine how effectively users are able to discover the right content in online libraries. For this reason, Springer Nature, the world's largest academic book publisher, has traditionally entrusted this task to their most expert editors. These editors manually analyse all new books, possibly including hundreds of chapters, and produce a list of the most relevant topics. Hence, this process has traditionally been very expensive, time-consuming, and confined to a few senior editors. For these reasons, back in 2016 we developed Smart Topic Miner (STM), an ontology-driven application that assists the Springer Nature editorial team in annotating the volumes of all books covering conference proceedings in Computer Science. Since then STM has been regularly used by editors in Germany, China, Brazil, India, and Japan, for a total of about 800 volumes per year. Over the past three years the initial prototype has iteratively evolved in response to feedback from the users and evolving requirements. In this paper we present the most recent version of the tool and describe the evolution of the system over the years, the key lessons learnt, and the impact on the Springer Nature workflow. In particular, our solution has drastically reduced the time needed to annotate proceedings and significantly improved their discoverability, resulting in 9.3 million additional downloads. We also present a user study involving 9 editors, which yielded excellent results in term of usability, and report an evaluation of the new topic classifier used by STM, which outperforms previous versions in recall and F-measure

    RuBQ: A Russian Dataset for Question Answering over Wikidata

    Full text link
    The paper presents RuBQ, the first Russian knowledge base question answering (KBQA) dataset. The high-quality dataset consists of 1,500 Russian questions of varying complexity, their English machine translations, SPARQL queries to Wikidata, reference answers, as well as a Wikidata sample of triples containing entities with Russian labels. The dataset creation started with a large collection of question-answer pairs from online quizzes. The data underwent automatic filtering, crowd-assisted entity linking, automatic generation of SPARQL queries, and their subsequent in-house verification. The freely available dataset will be of interest for a wide community of researchers and practitioners in the areas of Semantic Web, NLP, and IR, especially for those working on multilingual question answering. The proposed dataset generation pipeline proved to be efficient and can be employed in other data annotation projects. © 2020, Springer Nature Switzerland AG.We thank Mikhail Galkin, Svitlana Vakulenko, Daniil Sorokin, Vladimir Kovalenko, Yaroslav Golubev, and Rishiraj Saha Roy for their valuable comments and fruitful discussion on the paper draft. We also thank Pavel Bakhvalov, who helped collect RuWikidata8M sample and contributed to the first version of the entity linking tool. We are grateful to Yandex.Toloka for their data annotation grant. PB acknowledges support by Ural Mathematical Center under agreement No. 075-02-2020-1537/1 with the Ministry of Science and Higher Education of the Russian Federation

    ПАТОЛОГИЯ, ВЫЗВАННАЯ ИМПЛАНТАТОМ: АЛГОРИТМ ОПРЕДЕЛЕНИЯ ЧАСТИЦ ПРИ ГИСТОПАТОЛОГИЧЕСКОМ ИССЛЕДОВАНИИ СИНОВИАЛЬНО-ПОДОБНОЙ ОКОЛОПРОТЕЗНОЙ мембраны (SLIM)

    Get PDF
    In histopathologic SLIM diagnostic (synovial-like interface membrane, SLIM) apart from diagnosing periprosthetic infection particle identification has an important role to play. The differences in particle pathogenesis and variability of materials in endoprosthetics explain the particle heterogeneity that hampers the diagnostic identification of particles. For this reason, a histopathological particle algorithm has been developed. With minimal methodical complexity this histopathological particle algorithm offers a guide to prosthesis material-particle identification. Light microscopic-morphological as well as enzyme-histochemical characteristics and polarization-optical proporties have set and particles are defined by size (microparticles, macroparticles and supra- macroparticles) and definitely characterized in accordance with a dichotomous principle. Based on these criteria, identification and validation of the particles was carried out in 120 joint endoprosthesis pathological cases. A histopathological particle score (HPS) is proposed that summarizes the most important information for the orthopedist, material scientist and histopathologist concerning particle identification in the SLIM.Важную роль при гистопатологическом исследовании синовиально-подобной околопротезной мембраны (SLIM), наряду с диагностикой околопротезной инфекции, играет идентификация частиц. Различия в патогенезе частиц и разнообразии материалов для эндопротезирования объясняют ту гетерогенность, которая затрудняет диагностическую идентификацию частиц. По этой причине был разработан гистопатологический алгоритм диагностики частиц, который при минимальных методологических сложностях обеспечивает идентификацию частиц материала протеза. Простые микроскопически-морфологические и энзим-гистохимические характеристики, а также поляризационно-оптические свойства позволяют определить размер частиц (микрочастицы, макрочастицы и супер-макрочастицы) и характеризовать их по дихотомическому принципу. На основании этих критериев были выполнены идентификация и аттестация частиц в 120 случаях патологической реакции на эндопротез сустава. Предложена гистопатологическая шкала частиц (HPS), которая суммирует важнейшую информацию для ортопедов, материаловедов и гистопатологов, касающуюся идентификации частиц методом SLIM

    Particle fluxes associated with mesoscale eddies in the Sargasso Sea

    Get PDF
    Author Posting. © Elsevier B.V., 2008. This is the author's version of the work. It is posted here by permission of Elsevier B.V. for personal use, not for redistribution. The definitive version was published in Deep Sea Research Part II: Topical Studies in Oceanography 55 (2008): 1426-1444, doi:10.1016/j.dsr2.2008.02.007.We examined the impact of a cyclonic eddy and mode-water eddy on particle flux in the Sargasso Sea. The primary method used to quantify flux was based upon measurements of the natural radionuclide, 234Th, and these flux estimates were compared to results from sediment traps in both eddies, and a 210Po/210Pb flux method in the mode-water eddy. Particulate organic carbon (POC) fluxes at 150m ranged from 1 to 4 mmol C m-2 d-1 and were comparable between methods, especially considering differences in integration times scales of each approach. Our main conclusion is that relative to summer mean conditions at the Bermuda Atlantic Time-series Study (BATS) site, eddy-driven changes in biogeochemistry did not enhance local POC fluxes during this later, more mature stage of the eddy life cycle (>6 months old). The absence of an enhancement in POC flux puts a constraint on the timing of higher POC flux events, which are thought to have caused the local O2 minima below each eddy, and must have taken place >2 months prior to our arrival. The mode-water eddy did enhance preferentially diatom biomass in its center where we estimated a factor of 3 times higher biogenic Si flux than the BATS summer average. An unexpected finding in the highly depth resolved 234Th data sets are narrow layers of particle export and remineralization within the eddy. In particular, a strong excess 234Th signal is seen below the deep chlorophyll maxima which we attribute to remineralization of 234Th bearing particles. At this depth below the euphotic zone, de novo particle production in the euphotic zone has stopped, yet particle remineralization continues via consumption of labile sinking material by bacteria and/or zooplankton. These data suggest that further study of processes in ocean layers is warranted not only within, but below the euphotic zone.The EDDIES project was funded by the National Science Foundation Chemical, Biological, and Physical Oceanography Programs. Additional support for HPLC pigment analysis (Dr. Charles Trees, CHORS) was provided by NASA
    corecore